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Historically, coronaviruses have been recognized as a cause of minor respiratory infections in 
humans. However, the recent identification of three novel human coronaviruses, one causing severe 
acute respiratory syndrome (SARS), has prompted further examination of these viruses. Previous 
studies of geographically and chronologically distinct Human coronavirus 229E (HCoV-229E) 
isolates have found only limited variation within S gene nucleotide sequences. In contrast, analysis 
of the S genes of contemporary Human coronavirus OC 43 variants identified in Belgium revealed 
two distinct viruses circulating during 2003 and 2004. Here, the S and N gene sequences of 
25 HCoV-229E variants identified in Victoria, Australia, between 1979 and 2004 in patients 
with symptomatic infections were determined. Phylogenetic analysis showed clustering of the 
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isolates into four groups, with evidence of increasing divergence with time. Evidence of positive 
selection in the S gene was also established. 


Coronaviruses are enveloped, positive-stranded RNA viruses 
with a genome of approximately 30 kb (Lai, 1990). In 
animals they are significant veterinary pathogens, often 
causing severe disease (Holmes, 2001). Until recently, only 
two human coronaviruses were recognized, Human corona- 
virus 229E (HCoV-229E) and Human coronavirus OC43 
(HCoV-OC43). These viruses have generally been associated 
with symptoms of the common cold (McIntosh, 1996) but 
there is growing evidence that they occasionally cause more 
severe infections (Pene et al., 2003; Vabret et al., 2003; Birch 
et al., 2005). In late 2002, a third human coronavirus (SARS- 
CoV) was implicated as the aetiological agent of severe acute 
respiratory syndrome (SARS) (Drosten et al., 2003). Since 
then, two more human coronaviruses have been identified, 
HCoV-NL63 associated with upper and lower respiratory 
tract infections (van der Hoek et al., 2004) and HCoV- 
HKU 1 in patients with pneumonia (Woo et al., 2005). Based 
on their serological and genetic properties, the human 
coronaviruses are assigned to two of three existing antigenic 
serogroups: HCoV-229E and HCoV-NL63 in serogroup 1 
(Siddell, 1995; van der Hoek et al., 2004); HCoV-OC43, 


The GenBank/EMBL/DDBUJ accession numbers for the sequences 
reported in this paper are DQ243940-DQ243962 and DQ243964— 
DQ243987. 


A figure showing the phylogenetic analysis using PHYLIP on HCoV- 
Q29E isolates and strains identified between 1972 and 2004, and 
tables showing primers used and their nucleotide position relative to the 
HCoV-229E complete genome and the summary of amino acid 
changes in the S and N proteins of HCoV-229E variants studied are 
available as supplementary material in JGV Online. 


HCoV-HKU1 and, most likely, SARS-CoV in serogroup 2 
(Siddell, 1995; Snijder et al., 2003; Woo et al., 2005). 


Coronaviruses contain four structural proteins, spike (S), 
membrane (M), small envelope (E) and nucleocapsid (N) 
(Brian & Baric, 2005). A fifth, the haemagglutinin-esterase 
(HE) protein, exists in some serogroup 2 coronaviruses 
(Holmes & Lai, 1996). Limited functional and structural 
information exists for the HCoV-229E S protein, although 
receptor-binding activity is associated with the S1 subunit 
(Bonavia et al., 2003). In the related non-human Infectious 
bronchitis virus, S1 is involved in the induction of neutraliz- 
ing, serotype-specific and haemagglutination inhibiting 
antibodies (Cavanagh et al., 1988). 


Both recombination and point mutations have the capacity 
to drive coronavirus evolution (Navas-Martin & Weiss, 2003). 
Although recombination has not been detected for HCoV- 
229E or HCoV-OC43, a recombination breakpoint in the 
SARS-CoV polymerase has been described (Rest & Mindell, 
2003). Few studies on variation in the human coronavirus S 
protein have been reported. One study of three geographi- 
cally and chronologically distinct HCoV-229E isolates found 
limited variation within S gene nucleotide sequences (Hays 
& Myint, 1998). In contrast, two distinct variants of HCoV- 
OC43 circulated in Belgium between 2003 and 2004 (Vijgen 
et al., 2005). 


Prior to 2001, in our laboratory coronaviruses were identi- 
fied using a method involving isolation from human fibro- 
blasts, followed by confirmatory immune electron microscopy 
where these agents were suspected. Virus isolation attempts 
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on respiratory specimens were discontinued in 2001 and 
replaced by a RT-PCR capable of differentiating between 
HCoV-229E and HCoV-OC43 (Birch et al., 2005). RNA was 
extracted from virus isolates and clinical material by using a 
High Pure Viral Nucleic Acid kit (Roche Diagnostics) and 
reverse transcribed using avian myeloblastosis virus reverse 
transcriptase (Promega) at 42°C for 1h using random 
hexamers. The full S and N gene sequences of HCoV-229E 
were amplified in a nested PCR (Supplementary Table S1 
available in JGV Online) using a Qiagen Taq DNA polymerase 
kit (Qiagen). The cycling programmes consisted of an initial 
denaturation of 4 min at 94°C, followed by 40 cycles (first 
round) or 25 cycles (second round) of 30 s at 94°C, 30 s at 
60 °C and 130 s for the S gene or 90 s for the N gene at 72 °C, 
with a final extension of 7 min at 72°C. RT-PCR products 
were purified, sequenced in both directions using a cycle 
sequencing reaction (ABI Prism Big Dye Terminator Cycle 
Sequencing Ready Reaction kit; Perkin-Elmer) and analysed 
by using an ABI 3730S capillary sequencer. 


Nucleotide sequences were analysed and amino acid 
sequences determined using BIOEDIT sequence alignment 
editor version 7.0.1 (Hall, 1999). Alignments of nucleotide 
sequences were made with Multalin (Corpet, 1988) and 
manual editing of alignments was performed using Gene- 
doc version 2.5 (Nicholas & Nicholas, 1997). Expected 
transition/transversion ratios and gamma distribution 
parameter alpha were estimated using TREEPUZZLE version 
5.2 (Schmidt et al., 2002). Phylogenetic trees based on the 
optimum alignment were constructed using DNAdist 
and neighbour-joining method with PHYLIP version 3.63 
(Felsenstein, 1993) and parameters estimated from 
TREEPUZZLE. Unrooted phylograms were drawn with 
TREEVIEW version 1.5 (Page, 1996). Tests of selection were 
conducted using MEGA version 3.1 (Kumar et al., 2004). The 
codon-based model of Nei—Gojobori was used to compare 
synonymous (dS) and non-synonymous (dN) distances. 
Assumptions tested were purifying (dN<dS), neutral 
(dN =dS) and positive (dN > dS) selection. The probability 
computed was <0-05 for hypothesis rejection at the 5% 
level. Potential N-glycosylation sites in the HCoV-229E S$ 
protein were predicted using the NetNGlyc 1.0 Server at 
the Centre for Biological Sequence Analysis (http://www. 
cbs.dtu.dk/services/NetNGlyc/). 


Between 1979 and 2004, HCoV-229E was isolated or detected 
by RT-PCR in 25 patients aged from 0-3 to 58 years of age 
(Table 1). Patients suffered from symptoms varying from 
mild upper respiratory tract infection to pneumonia, and 
three (patients 12, 18 and 19) were hospitalized. Several 
presented with fever, fatigue and cough, consistent with a 
case definition of influenza-like illness (Clothier et al., 
2005). A total of 25 HCoV-229E sequences were analysed. 
Twenty-one were full-length S gene sequences, 13 from virus 
isolates and eight directly from clinical material. Partial S 
gene sequences were obtained from two samples of clinical 
material (patients 19 and 25 in Table 1). S gene sequences 
could not be obtained from the viruses detected in patients 


18 and 24. All S gene sequences available were compared to 
an ATCC prototype strain (HCoV-229E 1973; GenBank 
accession no. DQ243963, containing 3522 nt, 1173 aa). 


The HCoV-229E S gene encodes a 15 aa signal sequence, 
an N-terminal $1 domain (codons 16-560), an $2 domain 
(codons 561-1173) containing several heptad repeats, a 
transmembrane domain (codons 1117-1138) and a cyto- 
plasmic tail (codons 1139-1173) (Bonavia et al., 2003). The 
21 complete S gene sequences each contained 3513 nt 
(1170 aa). Deletion of 9 nt occurred in two discrete posi- 
tions in each of these sequences, at 681-683 and 1057-1062. 
Without exception this resulted in 3 aa deletions compared 
to the prototype strain, aspartic acid at codon 228 and valine 
and tyrosine at codons 353 and 354, respectively. The partial 
sequences comprised nt 1-957 and 1806-3513 (patient 19) 
and 1-1788 (patient 25). The deletion pattern described 
above for full-length sequences also applied to these variants 
within the constraints of the sequence information available. 
Comparison of the full or partial S amino acid sequences 
with the prototype sequence identified 67 individual codon 
differences and the three deletions described above. The 
deletions and most dN changes occurred in the $1 domain, 
between aa 16 and 560. Thirty of 54 aa substitutions detected 
in this region were non-conservative compared with 7 of 13 
substitutions identified outside the $1 domain (Table 2 and 
Supplementary Table S2 available in JGV Online). The 
putative N-glycosylation pattern (Asp—X—Ser/Thr) of the S 
protein of all variants differed slightly compared with the 
prototype. The 24 N-glycosylation sites in the prototype 
strain were also present in the variants, but two additional 
sites were generated in variants as a result of mutations to 
asparagine at codons 111 and 488. Binding of HCoV-229E 
to its cellular receptor human aminopeptidase N occurs 
through interaction with aa 417-517 of the S1 domain 
(Bonavia et al., 2003). Seven amino acid substitutions 
occurred in this region (Table 2). Three of them (Q430K, 
D444N and K488N) were non-conservative and included 
the generation of an N-glycosylation site at aa 488. 


Compared with the S protein of the HCoV-229E 1973 
reference isolate, many substitutions and each of the 
deletions were present in both clinical isolates and directly 
sequenced specimens. This suggests that passage in cell cul- 
ture has not substantially biased the analysis of the sequences 
studied. Additionally, we were able to compare the sequence 
of a virus isolate from 1999 (strain 13-18; GenBank acces- 
sion no. AY386392) with variants we obtained from that 
time onwards. Of 11 aa changes between our pre-1999 iso- 
lates and directly sequenced variants, eight were also present 
in the 1999 isolate. Overall, this indicated that the influence 
of cell passage on the sequences studied was limited. 


The entire sequence of the N gene was obtained for 23 of 25 
HCoV-229E variants; compared with the S gene, there was 
less heterogeneity in this gene and the most divergent 
variant (from patient 22 in Table 1) differed by only 25 nt 
(9 aa) from the 1973 prototype strain (HCoV-229E 1973; 
GenBank accession no. DQ243939). Seven amino acid 
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Table 1. Details of the patients involved and specimens from which HCoV-229E was either isolated (patients 1-13) or 
detected by RT-PCR directly from supplied clinical material (patients 14-25) 


Patient Age Gender Strain name Specimen Symptoms GenBank GenBank 
no. (years) type* accession no. _—accession no. 
S gene N gene 

1 NAT NA HCoV-229E-11/6/79 NA NA DQ243964 DQ243940 
2 14 Female HCoV-229E- 16/6/82 NTS URTI# DQ243965 DQ243941 
3 5 Male HCoV-229E-22/9/82 NTS URTI DQ243966 DQ243942 
4 41 Female HCoV-229E-6/10/82 NPA URTI DQ243967 DQ243943 
5 28 Female HCoV-229E-21/10/82 NTS Immune deficiency, fever DQ243968 DQ243944 
6 9 Male HCoV-229E-8/11/82a NTS Fever, URTI DQ243969 DQ243945 
7 4 Female HCoV-229E-8/11/82b NTS URTI, blocked ears DQ243970 DQ243946 
8 35 Female HCoV-229E-29/7/84 NTS URTI DQ243971 DQ243947 
9 48 Male HCoV-229E-5/9/84 NPA Pneumonia DQ243972 DQ243948 
10 12 Female HCoV-229E-24/6/90 NPA Pertussis DQ243973 DQ243949 
ll 29°5 Male HCoV-229E-12/5/92 NPA Viral, URTI DQ243975 DQ243950 
12 0:3 Female HCoV-229E-17/6/92 NPA Bronchiolitis DQ243976 DQ243951 
13 28 Female HCoV-229E-25/6/92 NTS Sore throat, fever, runny nose DQ243974 DQ243952 
14 34 Female HCoV-229E-8/8/01 NTS Fever, cough, myalgia DQ243977 DQ243953 
15 25 Male HCoV-229E-27/8/01 NTS Influenza-like DQ243978 DQ243954 
16 41 Male HCoV-229E-21/6/02 NTSS Influenza-like DQ243979 - 

ly 22 Male HCoV-229E-6/1/03 BAL NA DQ243980 DQ243955 
18 24 Female HCoV-229E-28/2/03 TsIl NA - DQ243960 
19 0-9 Female HCoV-229E-24/4/03 NPAG SARS DQ243981/2 DQ243056 
20 17 Male HCoV-229E-30/7/03 NTS Influenza-like DQ243983 DQ243957 
21 18 Male HCoV-229E- 14/8/03 NTS# Fever, cough, myalgia DQ243984 DQ243959 
22 44 Male HCoV-229E- 19/8/03 NTS Fever, cough, malaise, myalgia DQ243985 DQ243958 
23 42 Female HCoV-229E-25/8/03 NTS Fever, cough, headache DQ243986 DQ243961 
24 26 Female HCoV-229E-20/1/04 NsIl NA - DQ243962 
25 58 Female = HCoV-229E-27/1/04 Sputum$§ = Severe pneumonia DQ243987 - 


*NTS, Nose and throat swab; NPA, nasopharyngeal aspirate; BAL, bronchio-alveolar lavage; TS, throat swab; NS, nose swab. 


+NA, Not available. 

+URTI, Upper respiratory tract infection. 

§Only S gene sequences were obtained from patients 16 and 25. 
llOnly N gene sequences were obtained from patients 18 and 24. 
§Partial S gene sequences were obtained from patients 19 and 25. 
#Influenza virus also detected in this patient. 


changes, four of which were non-conservative, were shared 
by each variant. Four changes clustered in a hot spot 
between aa 224 and 229 (Supplementary Table S3 available 
in JGV Online). Deletions were detected in only two N gene 
sequences. The first was a three base (GGT) in-frame 
deletion at nt position 480 in the virus from patient 3 
resulting in the deletion of glycine at aa 160. The second was 
an A deletion from a string of five As starting at nt 611 in the 
variant from patient 14, producing a frameshift mutation 
and generation of a stop codon 52 aa downstream. This 
sequence was generated by direct RT-PCR from a clinical 
specimen and may be the result of amplification of non- 
infectious viral RNA. 


Phylogenetic analysis of the S gene assigned the 23 variants 
to four distinct groups each containing temporally 
associated viruses (Fig. 1). The groups comprised viruses 


circulating during the periods 1979-1982, 1982-1984, 
1990-1992 and 2001-2004. The expected transition/trans- 
version ratio and gamma distribution parameter alpha were 
estimated as 1-62 and 0-16, respectively, from the dataset 
using TREEPUZZLE (Schmidt et al., 2002). These values were 
used in DNAdist (Kimura) to produce a more accurate tree 
topology. A strong correlation between bootstrap values and 
these groupings was found. The expected transition/ 
transversion ratio and gamma distribution parameter 
alpha values estimated using the N gene dataset were 1-81 
and 0-19, respectively. Phylogenetic analysis of the N gene 
showed clustering of variants similar to that obtained with 
the S gene. However, less sequence variation in the N gene 
made it difficult to resolve the viruses into four groups. 
Rather, two major clusters comprising groups 1 and 2, and 
groups 3 and 4, respectively, were identified (Supplementary 
Fig. S1 available in JGV Online). 
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Table 2. Summary of amino acid changes in the S protein of selected HCoV-229E variants studied. The variants chosen 


represent the groups shown in Fig. 1 


Strain name Amino acid position 

22 26 35 51 88 90 104 109 111 112 113 «121 «#140 159 181 196 210 223 
HCoV-229E1973 L Y Y D L F D S D Vv L L Vv F A S T T 
HCoV-229E-11/6/79 _* H - - S - - - N - S I - - S - ~ N 
HCoV-229E-6/10/82 - H H - S - - Y N A S I N 
HCoV-229E-8/11/82a 9 — H H - S - - - N - S I - - S A - N 
HCoV-229E-29/7/84 M H H - S - - Y N A S I N 
HCoV-229E-24/6/90 M H H - S - - Y N A S I A S - - - N 
HCoV-229E-8/8/01 M Q H N S L - Y N A S I A S - - S N 
HCoV-229E- 14/8/03 M H H N S L A Y N A S I A S - - S N 
Strain name 225 228 229 248 307 309 310 311 312 314 316 318 321 324 342 349 9350 6352 
HCoV-229E1973.E D F S D K P Q S G K F Y G D K Y A 
HCoV-229E-11/6/79 - eT Vv A Y R Vv E - - - 
HCoV-229E-6/10/82 - 7 Vv A - - L - - Vv R - R Vv - - - G 
HCoV-229E-8/11/82a  — ‘ Vv A R Y R Vv E - - - 
HCoV-229E-29/7/84 - x Vv A - - L - - Vv R - R V - - F G 
HCoV-229E-24/6/90 - * Vv A N - L R - Vv R Y R Vv - Q F G 
HCoV-229E-8/8/01 A * Vv A N E L R R P R Y R Vv - Q F G 
HCoV-229E- 14/8/03 A ° Vv A N E Te R R P R Y R Vv - Q F G 
Strain name 353 354 355 356 357 358 401 404 406 407 408 409 410 411 424 430 444 462 
HCoV-229E1973.V Y A N Vv G Vv w y S K Y Y T G Q D Vv 
HCoV-229E-11/6/79 4 _ L - I N S - - - K N I 
HCoV-229E-6/10/82 * . - - F - - 1 - L N S - - Vv K N I 
HCoV-229E-8/11/82a * = L - I N S - - - K N I 
HCoV-229E-29/7/84 * . - - F - - L - L N S - - Vv K N I 
HCoV-229E-24/6/90 - * - - F - M L N L N S H - Vv K N I 
HCoV-229E-8/8/01 % : Vv K F D M L N Te N S H N Vv K N I 
HCoV-229E- 14/8/03 * * Vv K F D M L N L N S H N Vv K N I 
Strain name 465 488 530 642 669 676 681 714 765 775 849 871 937 971 1002 1005 
HCoV-229E1973 D K L R D I T N Vv A S T I T I M 
HCoV-229E-11/6/79 - N - M - - - K A - - I L R - - 
HCoV-229E-6/10/82 - N M M - - - K A S - I - R - - 
HCoV-229E-8/11/82a  — N - M E Vv K A - - I L R Vv - 
HCoV-229E-29/7/84 - N M M - - - K A S - I L R - - 
HCoV-229E-24/6/90 - N M M - - R K A S - I L R - I 
HCoV-229E-8/8/01 - N M M - - R K A S F I I R ~ I 
HCoV-229E- 14/8/03 E N M M - - R K A S - I Ie R ~ I 


*_, Denotes same amino acid as first sequence. 
+*, Denotes amino acid deletion. 


Within the clinical variants there was substantial evidence 
for temporal mutations. Many of these occurred in 1982 and 
persisted, while others, for example D51N, F90L and T2108, 
first occurred in 2001 and were present subsequently in all 
variants. Some mutations (A181S, $196A, D342E, S407], 
D669E, 1676V and 11002V) were selected only briefly and 
then replaced by variants possessing the codon existing 
before that time. At the time of generation of a new codon 
it was often possible to detect the prevailing variant 


co-circulating with the strain containing the new mutation; 
for example, L22 with M22 mutants in 1984 and D104 
with A104 mutants in 2003 (Table 2 and Supplementary 
Table S2 available in JGV Online). 


Alignment of S protein sequences and phylogenetic analysis 
of the S gene revealed several discrete time points where 
amino acid substitutions occurred and were then retained 
during subsequent years. These mutations may represent 
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HCoV-229E-14/8/03 


85 


HCoV-229E-21/6/02 


HCoV-229E-6/1/03 


HCoV-229E-27/8/01 


HCoV-229E-8/8/01 


13-18 


A162 


HCoV-229E- 17/6/92 


HCoV-229E-24/6/90 


HCoV-229E-12/5/92 


HCoV-229E-25/6/92 


HCoV-229E-29/7/84 


HCoV-229E-5/9/84 


HCoV-229E-6/10/82 5 
HCoV-229E-21/10/82 
HCoV-229E-8/11/82a —| 
HCoV-229E-8/11/82b 
HCoV-229E-16/6/82 L—- 4 


HCoV-229E-22/9/82 


HCoV-11/6/79 


HCoV-229E1973 


HCoV-229E-24/4/03 


HCoV-229E-25/8/03 


HCoV-229E-19/8/03 


HCoV-229E-27/1/04 


HCoV-229E-30/7/03 


Fig. 1. Phylogenetic analysis using PHYLIP 
(DNAdist/neighbour-joining, 1000 bootstrap 
replicates) on HCoV-229E isolates and 
strains identified between 1979 and 2004. 
The DNA sequence of the entire coding 
region of the HCoV-229E S gene was ana- 
lysed. Sequences in bold were obtained 
through GenBank and are strains detected 
in geographical locations other than Australia. 
Significant bootstrap values (>80%) and 
the clustering of viruses analysed into groups 
(1-4) are indicated. 


the outcome of natural evolution of the virus in the context 
of antibody pressure. Comparing rates of dS and dN 
nucleotide substitutions can test three forms of evolutionary 
selection on a gene. Equal dS and dN substitution rates 
demonstrate an absence of selection, whereas an excess of 
dS substitutions indicates purifying selection, operating to 
preserve protein structure and function. Excess dN sub- 
stitution rates indicate positive selection and occur mainly 
in genes with adaptive functions (Bush, 2001). Most positively 
selected genes previously identified have been pathogen 
surface proteins (Yang & Bielawski, 2000). To avoid 
recognition by antibodies generated in response to prior 
antigen exposure, surface proteins of pathogens must 
change their structure. Hence, it is likely that evasion of 
the host immune system drives repeated amino acid replace- 
ments in surface proteins (Bush, 2001). Using the Nei— 
Gojobori method for estimating statistical differences 
between dS and dN substitution rates (Kumar et al., 2004), 
we confirmed that the HCoV-229E S gene sequences had 


undergone positive selection over time. Probabilities of 0-00 
were obtained for purifying and neutral selection and both 
hypotheses were rejected. A probability of 1-0 was obtained 
when testing for positive selection and this hypothesis was 
therefore accepted. 


We could not show evidence for the emergence of distinct 
subtypes of HCoV-229E as recently reported for HCoV- 
OC43, where strains circulating in Belgium between 2003 and 
2004 showed sufficient unrelatedness at the amino acid level 
(3-1%) to suggest the existence of two genetically distinct 
strains (Vijgen et al., 2005). For the HCoV-229E variants we 
studied, the overall amino acid difference between the 1979 
strain and the 2004 strain was 3-3 %. When apparent anti- 
genic drift occurred, the extent of amino acid difference 
between the old variant and its replacement was less than 1 %. 


When the HCoV-229E S gene sequences from this study 
were compared to 26 sequences available through GenBank, 
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only two variants were similar. The first, strain A162 
(accession no. Y10051), was isolated in 1995 from an adult 
in Ghana (Hays & Myint, 1998). While the submitted 
sequence was 483 nt short of full-length, it grouped phylo- 
genetically with coronaviruses circulating in Australia 
between 1990 and 1992 (Fig. 1). Also similar was strain 
13-18, first isolated in 1999 in the USA. Although the 
sequence available was less than half that of the complete S 
gene, this virus grouped with variants circulating between 
2001 and 2004 in Australia (Fig. 1). Hence, the time of 
circulation of distinct HCoV-229E variants in south-eastern 
Australia generally coincided with that of variants circulat- 
ing elsewhere. 


Overall, sequencing and analysis of the S and N gene 
products of HCoV-229E strains circulating in Victoria, 
Australia, between 1979 and 2004 has provided the first 
evidence for genetic drift and positive selection as part of the 
evolution of this virus. The similarity between those strains 
circulating in Victoria and a small number of strains 
identified in other geographical locations at similar times 
indicates that HCoV-229E, despite having the potential, has 
not undergone major recombination events since it was first 
isolated in 1967. 
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